Op-cbio120168 2615..2623
نویسندگان
چکیده
Motivation: There is growing momentum to develop statistical learning (SL) methods as an alternative to conventional genome-wide association studies (GWAS). Methods such as random forests (RF) and gradient boosting machine (GBM) result in variable importance measures that indicate how well each single-nucleotide polymorphism (SNP) predicts the phenotype. For RF, it has been shown that variable importance measures are systematically affected by minor allele frequency (MAF) and linkage disequilibrium (LD). To establish RF and GBM as viable alternatives for analyzing genome-wide data, it is necessary to address this potential bias and show that SL methods do not significantly under-perform conventional GWAS methods. Results: Both LD and MAF have a significant impact on the variable importance measures commonly used in RF and GBM. Dividing SNPs into overlapping subsets with approximate linkage equilibrium and applying SL methods to each subset successfully reduces the impact of LD. A welcome side effect of this approach is a dramatic reduction in parallel computing time, increasing the feasibility of applying SL methods to large datasets. The created subsets also facilitate a potential correction for the effect of MAF using pseudocovariates. Simulations using simulated SNPs embedded in empirical data— assessing varying effect sizes, minor allele frequencies and LD patterns—suggest that the sensitivity to detect effects is often improved by subsetting and does not significantly under-perform the Armitage trend test, even under ideal conditions for the trend test. Availability: Code for the LD subsetting algorithm and pseudocovariate correction is available at http://www.nd.edu/ glubke/code.html. Contact: [email protected] Supplementary information: Supplementary data are available at Bioinformatics online. Received on March 7, 2012; revised on July 25, 2012; accepted on
منابع مشابه
Catalysis in ionic liquids.
2.5. Stabilization of IL Emulsions by Nanoparticles 2623 3. Hydrogenations in ILs 2623 3.1. Hydrogenation on IL-Stabilized Nanoparticles 2623 3.1.1. Hydrogenation of 1,3-Butadiene 2623 3.1.2. Hydrogenation of Alkenes and Arenes 2624 3.1.3. Hydrogenation of Ketones 2624 3.2. Homogeneous Catalytic Hydrogenation in ILs 2624 3.3. Hydrogenation of Functionalized ILs 2625 3.3.1. Selective Hydrogenati...
متن کاملRole of DFNA5 in hearing loss and cancer – a comment on Rakusic et al
License. The full terms of the License are available at http://creativecommons.org/licenses/by-nc/3.0/. Non-commercial uses of the work are permitted without any further permission from Dove Medical Press Limited, provided the work is properly attributed. Permissions beyond the scope of the License are administered by Dove Medical Press Limited. Information on how to request permission may be f...
متن کاملKeyword Index for Volume 110
ABCB1 1673 acceleration 2217 accreditation 850 actinic keratosis 520 active surveillance 2405 activin signalling 2604 acute myeloid leukaemia 783 ADAM9 2945 ADAMs 1535 adaptive design 1950 ADCC 1221 adenocarcinoma 1571, 2688 adherence 831 adhesion 146, 753 adjuvant chemotherapy 313 adjuvant therapy 1133, 1545 adolescent 1342 advanced adenoma 1228 advanced biliary tract cancer 2165 advanced neop...
متن کاملRED CELLS, IRON, AND ERYTHROPOIESIS Mature erythrocyte membrane homeostasis is compromised by loss of the GATA1-FOG1 interaction
GATA1 plays essential roles in erythroid gene expression. The N-terminal finger of GATA1 (GATA1-Nf) is important for association with FOG1. Substitution mutations in GATA1-Nf, such as GATA1V205M that diminish the GATA1-FOG1 association, have been identified in human thrombocytopenia and anemia cases. A mouse model of human thrombocytopenia has been established using a transgenic complementation...
متن کاملArrhythmia/Electrophysiology Intrinsic Cardiac Nerve Activity and Paroxysmal Atrial Tachyarrhythmia in Ambulatory Dogs
Background—Little is known about the relationship between intrinsic cardiac nerve activity (ICNA) and spontaneous arrhythmias in ambulatory animals. Methods and Results—We implanted radiotransmitters to record extrinsic cardiac nerve activity (ECNA; including stellate ganglion nerve activity and vagal nerve activity) and ICNA (including superior left ganglionated plexi nerve activity and ligame...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2012